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<400> 1 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 



r 



cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 2460 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac . actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 27 60 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
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ttttttcctg 


tttggtcact 


gatgcctccg 


tgtaaggggg 


atttctgttc 


atgggggtaa 


2880 


tgataccgat 


gaaacgagag 


aggatgctca 


cgatacgggt 


tactgatgat 


gaacatgccc 


2940 


ggttactgga 


acgttgtgag 


ggtaaacaac 


tggcggtatg 


gatgcggcgg 


gaccagagaa 


3000 


aaatcactca 


gggtcaatgc 


cagcgcttcg 


ttaatacaga 


tgtaggtgtt 


ccacagggta 


3060 


gccagcagca 


tcctgcgatg 


cagatccgga 


acataatggt 


gcagggcgct 


gacttccgcg 


3120 


tttccagact 


ttacgaaaca 


cggaaaccga 


agaccattca 


tgttgttgct 


caggtcgcag 


3180 


acgttttgca 


gcagcagtcg 


cttcacgttc 


gctcgcgtat 


cggtgattca 


ttctgctaac 


3240 


cagtaaggca 


accccgccag 


cctagccggg 


tcctcaacga 


caggagcacg 


atcatgcgca 


3300 


cccgtggcca 


ggacccaacg 


ctgcccgaga 


tctcgatccc 


gcgaaattaa 


tacgactcac 


3360 


tatagggaga 


ccacaacggt 


ttccctctag 


aaataatttt 


gtttaacttt 


aagaaggaga 


3420 


tatacat atg ggg gac get ccc age ( 


:;ct gaa gag 


aaa ctg cac ctt ate 


3469 



Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu lie 
15 10 

acc egg aae ctg cag gag gtt ctg ggg gaa gag aag ctg aag gag ata 3517 
Thr Arg Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu lie 
15 20 25 30 

ctg aag gag egg gaa ctt aaa att tac tgg gga acg gca acc acg ggc 3565 
Leu Lys Glu Arg Glu Leu Lys lie Tyr Trp Gly Thr Ala Thr Thr Gly 
35 40 45 

aaa eca eat gtg get tac ttt gtg ccc atg tea aag att gca gac ttc 3613 
Lys Pro His Val Ala Tyr Phe Val Pro Met Ser Lys lie Ala Asp Phe 
50 55 60 

tta aag gca ggg tgt gag gta aca att ctg ttt gcg gac etc cac gca 3661 
Leu Lys Ala Gly Cys Glu Val Thr lie Leu Phe Ala Asp Leu His Ala 
65 70 75 

tac ctg gat aac atg aaa gee eca tgg gaa ctt eta gaa etc ega gte 3709 
Tyr Leu Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Arg Val 
80 85 90 

agt tac tat gag aat gtg ate aaa gca atg ctg gag age att ggt gtg 3757 
Ser Tyr Tyr Glu Asn Val lie Lys Ala Met Leu Glu Ser He Gly Val 
95 100 105 110 

ccc ttg gag aag etc aag ttc ate aaa ggc act gat tac cag etc age 3805 
Pro Leu Glu Lys Leu Lys Phe lie Lys Gly Thr Asp Tyr Gin Leu Ser 
115 120 125 

aaa gag tac aca eta gat gtg tac aga etc tec tec gtg gtc aca cag 3853 
Lys Glu Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin 
130 135 140 

cac gat tee aag aag get gga get gag gtg gta aag cag gtg gag cac 3901 
His Asp Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His 
145 150 155 

cet ttg ctg agt ggc etc tta tac ccc gga ctg cag get ttg gat gaa 3949 
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Pro Leu Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu 
160 165 170 

gag tat tta aaa gta gat gcc caa ttt gga ggc att gat cag aga aag 3997 
Glu Tyr Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys 
175 180 185 190 

att ttc acc ttt gca gag aag tac etc cct gca ctt ggc tat tea aaa 4045 
He Phe Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys 
195 200 205 

egg gtc cat ctg atg aat cct atg gtt cca gga tta aca ggc age aaa 4 093 
Arg Val His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys 
210 215 220 

atg age tet tea gaa gag gag tee aag att gat etc ctt gat egg aag 4141 
Met Ser Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys 
225 230 235 

gag gat gtg aag aaa aaa ctg aag aag gee ttc tgt gag cca gga aat 4189 
Glu Asp Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn 
240 245 250 

gtg gag aac aat ggg gtt ctg tec ttc ate aag cat gtc ctt ttt cec 4237 
Val Glu Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro 
255 260 265 270 

ctt aag tec gag ttt gtg ate eta cga gat gag aaa tgg ggt gga aac 4285 
Leu Lys Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn 
275 280 285 

aaa acc tac aca get tac gtg gae ctg gaa aag gac ttt get get gag 4333 
Lys Thr Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu 
290 295 300 

gtt gta cat cct gga gac ctg aag aat tct gtt gaa gtc gca ctg aac 4 381 
Val Val His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn 
305 310 315 

aag ttg ctg gat cca ate egg gaa aag ttt aat acc cct gcc ctg aaa 4429 
Lys Leu Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys 
320 325 330 

aaa ctg gcc age get gee tac cca gat cec tea aag cag aag cca atg 4477 
Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met 
335 340 345 350 

gcc aaa ggc cct gee aag aat tea gaa cca gag gag gtc ate cca tee 4525 
Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val He Pro Ser 
355 360 365 

egg ctg gat ate cgt gtg ggg aaa ate ate act gtg gag aag cac cca 4 573 
Arg Leu Asp He Arg Val Gly Lys He He Thr Val Glu Lys His Pro 
370 375 380 

gat gca gac age ctg tat gta gag aag att gac gtg ggg gaa get gaa 4 621 
Asp Ala Asp Ser Leu Tyr Val Glu Lys He Asp Val Gly Glu Ala Glu 
385 390 395 

cca egg act gtg gtg age ggc ctg gta cag ttc gtg ece aag gag gaa 4 669 
Pro Arg Thr Val Val Ser Gly Leu Val Gin Phe Val Pro Lys Glu Glu 
400 405 410 
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ctg cag gac agg ctg gta gtg gtg ctg tgc aac ctg aaa ccc cag aag 4717 
Leu Gin Asp Arg Leu Val Val Val Leu Cys Asn Leu Lys Pro Gin Lys 
415 420 425 430 

atg aga gga gtc gag tec caa ggc atg ctt ctg tgt get tct ata gaa 4765 
Met Arg Gly Val Glu Ser Gin Gly Met Leu Leu Cys Ala Ser lie Glu 
435 440 445 

ggg ata aac cgc cag gtt gaa cct ctg gac cct ccg gca ggc tct get 4813 
Gly lie Asn Arg Gin Val Glu Pro Leu Asp Pro Pro Ala Gly Ser Ala 
450 455 460 

cct ggt gag cac gtg ttt gtg aag ggc tat gaa aag ggc caa cea gat 48 61 
Pro Gly Glu His Val Phe Val Lys Gly Tyr Glu Lys Gly Gin Pro Asp 
465 470 475 

gag gag etc aag ccc aag aag. aaa gtc ttc gag aag ttg cag get gac 4909 
Glu Glu Leu Lys Pro Lys Lys Lys Val Phe Glu Lys Leu Gin Ala Asp 
480 485 490 

ttc aaa att tct gag gag tgc ate gca cag tgg aag caa ace aac ttc 4 957 
Phe Lys lie Ser Glu Glu Cys lie Ala Gin Trp Lys Gin Thr Asn Phe 
495 500 505 510 

atg acc aag ctg ggc tec att tee tgt aaa teg ctg aaa ggg ggg aac 5005 
Met Thr Lys Leu Gly Ser lie Ser Cys Lys Ser Leu Lys Gly Gly Asn 
515 520 525 

att age etc gag cac cac cac cac cac cac tgagatccgg ctgctaacaa 5055 
lie Ser Leu Glu His His His His His His 
530 535 

ageccgaaag gaagctgagt tggctgctgc caccgctgag caataactag eataaeeect 5115 

tggggcctct aaacgggtct tgaggggttt tttgetgaaa ggaggaacta tatccggat 5174 



<210> 2 
<211> 536 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human 
full-length TyrRS in pET20B 



<400> 2 

Met Gly Asp Ala Pro Ser Pro Glu 
1 5 

Asn Leu Gin Glu Val Leu Gly Glu 
20 

Glu Arg Glu Leu Lys lie Tyr Trp 

35 40 

His Val Ala Tyr Phe Val Pro Met 
50 55 



Glu Lys Leu His Leu lie Thr Arg 
10 15 

Glu Lys Leu Lys Glu lie Leu Lys 
25 30 

Gly Thr Ala Thr Thr Gly Lys Pro 
45 

Ser Lys lie Ala Asp Phe Leu Lys 
60 
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Ala Gly Cys Glu Val Thr lie Leu Phe Ala Asp Leu His Ala Tyr Leu 
65 70 75 80 

Asp Asn Met Lys Ala Pro Trp Glu Leu Jjeu Glu Leu Arg Val Ser Tyr 
85 90 95 

Tyr Glu Asn Val lie Lys Ala Met Leu Glu Ser lie Gly Val Pro Leu 
100 105 110 

Glu Lys Leu Lys Phe lie Lys Gly Thr Asp Tyr Gin Leu Ser Lys Glu 
115 120 125 

Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin His Asp 
130 135 140 

Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His Pro Leu 
145 150 155 160 

Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu Glu Tyr 
165 170 175 

Leu Lys Val Asp Ala Gin Phe Gly Gly lie Asp Gin Arg Lys lie Phe 
180 185 190 

Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys Arg Val 
195 200 205 

His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys Met Ser 
210 215 220 

Ser Ser Glu Glu Glu Ser Lys lie Asp Leu Leu Asp Arg Lys Glu Asp 
225 230 235 240 

Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn Val Glu 
245 250 255 

Asn Asn Gly Val Leu Ser Phe lie Lys His Val Leu Phe Pro Leu Lys 
260 265 270 

Ser Glu Phe Val lie Leu Arg Asp Glu Lys Trp Gly Gly Asn Lys Thr 
275 280 285 

Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu Val Val 
290 295 300 

His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn Lys Leu 
305 310 315 320 

Leu Asp Pro lie Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys Lys Leu 
325 330 335 

Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met Ala Lys 
340 345 350 

Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val lie Pro Ser Arg Leu 
355 360 365 

Asp lie Arg Val Gly Lys lie lie Thr Val Glu Lys His Pro Asp Ala 
370 375 380 



Asp Ser Leu Tyr Val Glu Lys lie Asp Val Gly Glu Ala Glu Pro Arg 
385 390 395 400 
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Thr Val Val Ser Gly Leu Val Gin Phe Val Pro Lys Glu Glu Leu Gin 
405 410 415 

Asp Arg Leu Val Val Val Leu Cys Asn Leu Lys Pro Gin Lys Met Arg 
420 425 430 

Gly Val Glu Ser Gin Gly Met Leu Leu Cys Ala Ser lie Glu Gly lie 
435 440 445 

Asn Arg Gin Val Glu Pro Leu Asp Pro Pro Ala Gly Ser Ala Pro Gly 
450 455 460 

Glu His Val Phe Val Lys Gly Tyr Glu Lys Gly Gin Pro Asp Glu Glu 
465 470 475 480 

Leu Lys Pro Lys Lys Lys Val Phe Glu Lys Leu Gin Ala Asp Phe Lys 
485 490 495 

He Ser Glu Glu Cys He Ala Gin Trp Lys Gin Thr Asn Phe Met Thr 
500 505 510 

Lys Leu Gly Ser He Ser Cys Lys Ser Leu Lys Gly Gly Asn He Ser 
515 520 525 

Leu Glu His His His His His His 
530 535 



<210> 3 
<211> 4682 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4543) 
<220> 

<223> Description of Artificial Sequence: human mini 
TyrRS in pET20B 

<400> 3 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
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gagtattcaa catttccgtg tcgcccttat 
ttttgctcac ccagaaacgc tggtgaaagt 
agtgggttac atcgaactgg atctcaacag 
agaacgtttt ccaatgatga gcacttttaa 
tattgacgcc gggcaagagc aactcggtcg 
tgagtactca ccagtcacag aaaagcatct 
cagtgctgcc ataaccatga gtgataacac 
aggaccgaag gagctaaccg cttttttgca 
tcgttgggaa ccggagctga atgaagccat 
tgcagcaatg gcaacaacgt tgcgcaaact 
f-, ccggcaacaa ttaatagact ggatggaggc 

1,% ggcccttccg gctggctggt ttattgctga 

H cggtatcatt gcagcactgg ggccagatgg 

H : I 

gacggggagt caggcaacta tggatgaacg 
actgattaag cattggtaac tgtcagacca 
^ aaaacttcat ttttaattta aaaggatcta 

LJ caaaatccct taacgtgagt tttcgttcca 

l^l aggatcttct tgagatcctt tttttctgcg 

accgctacca gcggtggttt gtttgccgga 
aactggcttc agcagagcgc agataccaaa 
ccaccacttc aagaactctg tagcaccgcc 
agtggctgct gccagtggcg ataagtcgtg 
accggataag gcgcagcggt cgggctgaac 
gcgaacgacc tacaccgaac tgagatacct 
tcccgaaggg agaaaggcgg acaggtatcc 
cacgagggag cttccagggg gaaacgcctg 
cctctgactt gagcgtcgat ttttgtgatg 
cgccagcaac gcggcctttt tacggttcct 
ctttcctgcg ttatcccctg attctgtgga 
taccgctcgc cgcagccgaa cgaccgagcg 
gcgcctgatg cggtattttc tccttacgca 



tccctttttt gcggcatttt gccttcctgt 660 
aaaagatgct gaagatcagt tgggtgcacg 720 
cggtaagatc cttgagagtt ttcgccccga 780 
agttctgcta tgtggcgcgg tattatcccg 840 
ccgcatacac tattctcaga atgacttggt 900 
tacggatggc atgacagtaa gagaattatg 960 
tgcggccaac ttacttctga caacgatcgg 1020 
caacatgggg gatcatgtaa ctcgccttga 1080 
accaaacgac gagcgtgaca ccacgatgcc 1140 
attaactggc gaactactta ctctagcttc 1200 
ggataaagtt gcaggaccac ttctgcgctc 1260 
taaatctgga gccggtgagc gtgggtctcg 1320 
taagccctcc cgtatcgtag ttatctacac 1380 
aaatagacag atcgctgaga taggtgcctc 1440 
agtttactca tatatacttt agattgattt 1500 
ggtgaagatc ctttttgata atctcatgac 1560 
ctgagcgtca gaccccgtag aaaagatcaa 1620 
cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
tcaagagcta ccaactcttt ttccgaaggt 1740 
tactgtcctt ctagtgtagc cgtagttagg 1800 
tacatacctc gctctgctaa tcctgttacc 1860 
tcttaccggg ttggactcaa gacgatagtt 1920 
ggggggttcg tgcacacagc ccagcttgga 1980 
acagcgtgag ctatgagaaa gcgccacgct 2040 
ggtaagcggc agggtcggaa caggagagcg 2100 
gtatctttat agtcctgtcg ggtttcgcca 2160 
ctcgtcaggg gggcggagcc tatggaaaaa 2220 
ggccttttgc tggccttttg ctcacatgtt 2280 
taaccgtatt accgcctttg agtgagctga 2340 
cagcgagtca gtgagcgagg aagcggaaga 24 00 
tctgtgcggt atttcacacc gcatatatgg 24 60 
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tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 

cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 

catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 

tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 

ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 

tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 

ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 

aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 

gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 

tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 

acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 

cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 

cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 

tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 

tatacat atg ggg gac get ccc age cct gaa gag aaa ctg cac ctt ate 34 69 
Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu He 
15 10 

acc egg aac ctg cag gag gtt ctg ggg gaa gag aag ctg aag gag ata 3517 
Thr Arg Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu He 
15 20 25 30 

ctg aag gag egg gaa ett aaa att tac tgg gga acg gca acc acg ggc 3565 
Leu Lys Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly 
35 40 45 

aaa eca cat gtg get tac ttt gtg ccc atg tea aag att gca gac ttc 3613 
Lys Pro His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe 
50 55 60 

tta aag gca ggg tgt gag gta aca att ctg ttt gcg gac etc cac gca 3661 
Leu Lys Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala 
65 70 75 

tac ctg gat aac atg aaa gee eca tgg gaa ctt eta gaa etc cga gtc 3709 
Tyr Leu Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Arg Val 
80 85 90 

agt tac tat gag aat gtg ate aaa gca atg ctg gag age att ggt gtg 3757 
Ser Tyr Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val 
95 100 105 110 
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ccc ttg gag aag etc aag ttc ate aaa ggc act gat tac eag etc age 
Pro Leu Glu Lys Leu Lys Phe lie Lys Gly Thr Asp Tyr Gin Leu Ser 
115 120 125 



eac gat tee aag aag get gga get gag gtg gta aag eag gtg gag cac 
His Asp Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His 
145 150 155 



gag tat tta aaa gta gat gee caa ttt gga ggc att gat eag aga aag 
Glu Tyr Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys 
175 180 185 190 



aag ttg ctg gat cea ate egg gaa aag ttt aat ace cet gee ctg aaa 
Lys Leu Leu Asp Pro He Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys 
320 325 330 



3805 



aaa gag tac aea eta gat gtg tac aga etc tee tee gtg gte aca eag 3853 
Lys Glu Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin 
130 135 140 



3901 



cet ttg ctg agt ggc etc tta tac ccc gga ctg cag get ttg gat gaa 394 9 
Pro Leu Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu 
160 165 170 



3997 



att ttc ace ttt gea gag aag tac etc cet gea ctt ggc tat tea aaa 4045 
He Phe Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys 
195 200 205 

egg gte eat ctg atg aat ect atg gtt cea gga tta aea ggc age aaa 4 093 
Arg Val His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys 
210 215 220 

atg age tet tea gaa gag gag tee aag att gat etc ctt gat egg aag 4141 
Met Ser Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys 
225 230 235 

gag gat gtg aag aaa aaa ctg aag aag gee ttc tgt gag cea gga aat 4189 
Glu Asp Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn 
240 245 250 

gtg gag aac aat ggg gtt ctg tee ttc ate aag eat gte ctt ttt ccc 4237 
Val Glu Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro 
255 260 265 270 

ctt aag tee gag ttt gtg ate eta ega gat gag aaa tgg ggt gga aac 4285 
Leu Lys Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn 
275 280 285 

aaa ace tac aca get tac gtg gae ctg gaa aag gac ttt get get gag 4333 
Lys Thr Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu 
290 295 300 

gtt gta eat cet gga gae ctg aag aat tet gtt gaa gte gea ctg aac 4381 
Val Val His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn 
305 310 315 



4429 



aaa ctg gee age get gee tac cea gat ccc tea aag cag aag eca atg 4477 
Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met 
335 340 345 350 
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gcc aaa ggc cct gcc aag aat tea gaa cca gag gag gtc ate etc gag 4525 
Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val lie Leu Glu 
355 3.60 365 

cae cac eae eac eac cae tgagatccgg ctgctaaeaa agcccgaaag 4573 
His His His His His His 
370 

gaagctgagt tggctgctgc caccgetgag caataaetag eataacccct tggggcctet 4 633 

aaaegggtet tgaggggttt tttgetgaaa ggaggaaeta tateeggat 4 682 



<210> 4 
<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human mini 
TyrRS in pET20B 

<400> 4 

Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu lie Thr Arg 
15 10 15 

Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu lie Leu Lys 
20 25 30 

Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly Lys Pro 
35 40 45 

His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe Leu Lys 
50 55 60 

Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala Tyr Leu 
65 70 75 80 

Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Arg Val Ser Tyr 
85 90 95 

Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser- He Gly Val Pro Leu 
100 105 110 

Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser Lys Glu 
115 120 125 

Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin His Asp 
130 135 140 

Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His Pro Leu 
145 150 155 160 

Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu Glu Tyr 
165 170 175 

Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys He Phe 
180 185 190 

Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys Arg Val 
195 200 205 
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His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys Met Ser 
210 215 220 

Ser Ser Glu Glu Glu Ser Lys lie Asp Leu Leu Asp Arg Lys Glu Asp 
225 230 235 240 

Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn Val Glu 
245 250 255 

Asn Asn Gly Val Leu Ser Phe lie Lys His Val Leu Phe Pro Leu Lys 
260 265 270 

Ser Glu Phe Val He Leu Arg Asp Glu Lys Trp Gly Gly Asn Lys Thr 
275 280 285 

Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu Val Val 
290 295 300 

His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn Lys Leu 
305 310 315 320 

Leu Asp Pro lie Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys Lys Leu 
325 330 335 

Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met Ala Lys 
340 345 350 

Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val He Leu Glu His His 
355 360 365 

His His His His 
370 



<210> 5 
<211> 4100 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (3961) 
<220> 

<223> Description of Artificial Sequence: human TyrRS 
carboxyl-terminal domain in pET20B 

<400> 5 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
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acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 4 80 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
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ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 

taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 

gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 

tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 

cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 

catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 27 60 

tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 

ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 

tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 

ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 

aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 

gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 

tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 

acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 

cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 

cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 

tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 

tatacat atg cca gag gag gtc ate cca tec egg etg gat ate cgt gtg 3469 
Met Pro Glu Glu Val lie Pro Ser Arg Leu Asp lie Arg Val 
15 10 

ggg aaa ate ate aet gtg gag aag cae eea gat gca gae age etg tat 3517 
Gly Lys He He Thr Val Glu Lys His Pro Asp Ala Asp Ser Leu Tyr 
15 20 25 30 

gta gag aag att gae gtg ggg gaa get gaa eea egg aet gtg gtg age 3565 
Val Glu Lys He Asp Val Gly Glu Ala Glu Pro Arg Thr Val Val Ser 
35 40 45 

ggc etg gta eag tte gtg eee aag gag gaa etg cag gae agg etg gta 3613 
Gly Leu Val Gin Phe Val Pro Lys Glu Glu Leu Gin Asp Arg Leu Val 
50 55 60 

gtg gtg etg tge aae etg aaa eee eag aag atg aga gga gte gag tec 3661 
Val Val Leu Cys Asn Leu Lys Pro Gin Lys Met Arg Gly Val Glu Ser 
65 70 75 

eaa gge atg ett etg tgt get tet ata gaa ggg ata aae cgc eag gtt 3709 
Gin Gly Met Leu Leu Cys Ala Ser He Glu Gly He Asn Arg Gin Val 
80 85 90 
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gaa cct ctg gac cct ccg gca ggc tct get cct ggt gag cac gtg ttt 3757 

Glu Pro Leu Asp Pro Pro Ala Gly Ser Ala Pro Gly Glu His Val Phe 
95 100 105 110 

gtg aag ggc tat gaa aag ggc caa cca gat gag gag etc aag ccc aag 3805 
Val Lys Gly Tyr Glu Lys Gly Gin Pro Asp Glu Glu Leu Lys Pro Lys 
115 120 125 

aag aaa gtc ttc gag aag ttg cag get gac ttc aaa att tct gag gag 3853 
Lys Lys Val Phe Glu Lys Leu Gin Ala Asp Phe Lys lie Ser Glu Glu 
130 135 140 

tgc ate gca cag tgg aag caa ace aac ttc atg ace aag ctg ggc tec 3901 
Cys lie Ala Gin Trp Lys Gin Thr Asn Phe Met Thr Lys Leu Gly Ser 
145 150 155 

att tec tgt aaa teg ctg aaa ggg ggg aac att age etc gag cac cac 394 9 
lie Ser Cys Lys Ser Leu Lys. Gly Gly Asn lie Ser Leu Glu His His 
160 165 170 

cac cac cac cac tgagateegg ctgetaacaa agcccgaaag gaagetgagt 4001 

His His His His 

175 

tggetgctgc caccgetgag caataactag cataacccct tggggeetct aaacgggtet 4061 
tgaggggttt tttgetgaaa ggaggaacta tatecggat 4100 

<210> 6 
<211> 178 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human TyrRS 
carboxyl-terminal domain in pET20B 

<400> 6 

Met Pro Glu Glu Val lie Pro Ser Arg Leu Asp lie Arg Val Gly Lys 
15 10 15 

He He Thr Val Glu Lys His Pro Asp Ala Asp Ser Leu Tyr Val Glu 
20 25 30 

Lys He Asp Val Gly Glu Ala Glu Pro Arg Thr Val Val Ser Gly Leu 
35 40 45 

Val Gin Phe Val Pro Lys Glu Glu Leu Gin Asp Arg Leu Val Val Val 
50 55 60 

Leu Cys Asn Leu Lys Pro Gin Lys Met Arg Gly Val Glu Ser Gin Gly 
65 70 75 80 

Met Leu Leu Cys Ala Ser He Glu Gly He Asn Arg Gin Val Glu Pro 
85 90 95 

Leu Asp Pro Pro Ala Gly Ser Ala Pro Gly Glu His Val Phe Val Lys 
100 105 110 

Gly Tyr Glu Lys Gly Gin Pro Asp Glu Glu Leu Lys Pro Lys Lys Lys 
115 120 125 
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Val Phe Glu Lys Leu Gin Ala Asp Phe Lys lie Ser Glu Glu Cys lie 
130 135 140 

Ala Gin Trp Lys Gin Thr Asn Phe Met Thr Lys Leu Gly Ser He Ser 
145 150 155 160 

Cys Lys Ser Leu Lys Gly Gly Asn He Ser Leu Glu His His His His 
165 170 175 



His His 



<210> 7 
<211> 4682 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4543) 
<220> 

<223> Description of Artificial Sequence: human mini 
TyrRS mutant in pET20B 

<400> 7 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
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aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
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tgataccgat 


gaaacgagag 


aggatgctca 


cgatacgggt 


tactgatgat 


gaacatgccc 


2940 


ggttactgga 


acgttgtgag 


ggtaaacaac 


tggcggtatg 


gatgcggcgg 


gaccagagaa 


3000 


aaatcactca 


gggtcaatgc 


cagcgcttcg 


ttaatacaga 


tgtaggtgtt 


ccacagggta 


3060 


gccagcagca 


tcctgcgatg 


cagatccgga 


acataatggt 


gcagggcgct 


gacttccgcg 


3120 


tttccagact 


ttacgaaaca 


cggaaaccga 


agaccattca 


tgttgttgct 


caggtcgcag 


3180 


acgttttgca 


gcagcagtcg 


cttcacgttc 


gctcgcgtat 


cggtgattca 


ttctgctaac 


3240 


cagtaaggca 


accccgccag 


cctagccggg 


tcctcaacga 


caggagcacg 


atcatgcgca 


3300 


cccgtggcca 


ggacccaacg 


ctgcccgaga 


tctcgatccc 


gcgaaattaa 


tacgactcac 


3360 


tatagggaga 


ccacaacggt 


ttccctctag 


aaataatttt 


gtttaacttt 


aagaaggaga 


3420 


tatacat atg ggg gac get ccc age < 


cct gaa gag 


aaa ctg cac ctt ate 


3469 



Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu lie 
1 5 10 

acc egg aac ctg cag gag gtt ctg ggg gaa gag aag ctg aag gag ata 3517 
Thr Arg Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu lie 
15 20 25 30 

ctg aag gag egg gaa ctt aaa att tac tgg gga aeg gea acc aeg gge 3565 
Leu Lys Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly 
35 40 45 

aaa cea cat gtg get tac ttt gtg ccc atg tea aag att gea gac ttc 3613 
Lys Pro His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe 
50 55 60 

tta aag gea ggg tgt gag gta aca att ctg ttt gcg gac etc cac gea 3661 
Leu Lys Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala 
65 70 75 

tac ctg gat aac atg aaa gee eca tgg gaa ctt eta gaa ctg cag gte 3709 
Tyr Leu Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Gin Val 
80 85 90 

agt tac tat gag aat gtg ate aaa gea atg ctg gag age att ggt gtg 3757 
Ser Tyr Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val 
95 100 105 110 

ccc ttg gag aag etc aag ttc ate aaa gge act gat tac cag etc age 3805 
Pro Leu Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser 
115 120 125 

aaa gag tac aca eta gat gtg tac aga etc tee tec gtg gte aca cag 3853 
Lys Glu Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin 
130 135 140 

cac gat tec aag aag get gga get gag gtg gta aag cag gtg gag cac 3901 
His Asp Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His 
145 150 155 

cct ttg ctg agt gge etc tta tac ccc gga ctg cag get ttg gat gaa 394 9 
Pro Leu Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu 
160 165 170 
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gag tat tta aaa gta gat gcc caa ttt gga ggc att gat cag aga aag 3997 
Glu Tyr Leu Lys Val Asp Ala Gin Phe Gly Gly lie Asp Gin Arg Lys 
175 180 185 190 

att ttc acc ttt gca gag aag tac etc cct gca ctt ggc tat tea aaa 4045 
lie Phe Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys 
195 200 205 

egg gtc cat ctg atg aat cct atg gtt cca gga tta aca ggc age aaa 4093 
Arg Val His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys 
210 215 220 

atg age tet tea gaa gag gag tee aag att gat etc ett gat egg aag 4141 
Met Ser Ser Ser Glu Glu Glu Ser Lys lie Asp Leu Leu Asp Arg Lys 
225 230 235 

gag gat gtg aag aaa aaa ctg aag aag gee ttc tgt gag cca gga aat 4189 
Glu Asp Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn 
240 245 250 

gtg gag aac aat ggg gtt ctg tec ttc ate aag cat .gtc ctt ttt ecc 4237 
Val Glu Asn Asn Gly Val Leu Ser Phe lie Lys His Val Leu Phe Pro 
255 260 265 270 

ctt aag tee gag ttt gtg ate eta ega gat gag aaa tgg ggt gga aac 4285 
Leu- Lys Ser Glu Phe Val lie Leu Arg Asp Glu Lys Trp Gly Gly Asn 
275 280 285 

aaa ace tac aca get tac gtg gac ctg gaa aag gae ttt get get gag 4333 
Lys Thr Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu 
290 295 300 

gtt gta cat cct gga gae ctg aag aat tet gtt gaa gtc gca ctg aac 4381 
Val Val His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn 
305 310 315 

aag ttg ctg gat cca ate egg gaa aag ttt aat ace cct gcc ctg aaa 4429 
Lys Leu Leu Asp Pro lie Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys 
320 325 330 

aaa ctg gee age get gee tac cca gat cec tea aag eag aag cca atg 4477 
Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met 
335 340 345 350 

gee aaa ggc cct gcc aag aat tea gaa cca gag gag gtc ate etc gag 4525 
Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val lie Leu Glu 
355 360 365 

eac cae eac cac cac cac tgagatecgg ctgctaacaa ageecgaaag 4 573 

His His His His His His 
370 

gaagctgagt tggctgetgc cacegetgag caataactag cataaeccet tggggcctct 4 633 
aaacgggtet tgaggggttt tttgctgaaa ggaggaaeta tatceggat 4 682 
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<210> 8 
<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : human mini 
TyrRS mutant in pET20B 

<400> 8 

Met Gly Asp Ala Pro Ser Pro Glu Glu Lys Leu His Leu He Thr Arg 
15 10 15 

Asn Leu Gin Glu Val Leu Gly Glu Glu Lys Leu Lys Glu He Leu Lys 
20 25 30 

Glu Arg Glu Leu Lys He Tyr Trp Gly Thr Ala Thr Thr Gly Lys Pro 
35 40 45 

His Val Ala Tyr Phe Val Pro Met Ser Lys He Ala Asp Phe Leu Lys 
50 55 60 . 

Ala Gly Cys Glu Val Thr He Leu Phe Ala Asp Leu His Ala Tyr Leu 
65 70 75 80 

Asp Asn Met Lys Ala Pro Trp Glu Leu Leu Glu Leu Gin Val Ser Tyr 
85 90 95 

Tyr Glu Asn Val He Lys Ala Met Leu Glu Ser He Gly Val Pro Leu 
100 105 110 

Glu Lys Leu Lys Phe He Lys Gly Thr Asp Tyr Gin Leu Ser Lys Glu 
115 120 125 

Tyr Thr Leu Asp Val Tyr Arg Leu Ser Ser Val Val Thr Gin His Asp 
130 135 140 

Ser Lys Lys Ala Gly Ala Glu Val Val Lys Gin Val Glu His Pro Leu 
145 150 155 160 

Leu Ser Gly Leu Leu Tyr Pro Gly Leu Gin Ala Leu Asp Glu Glu Tyr 
165 170 175 

Leu Lys Val Asp Ala Gin Phe Gly Gly He Asp Gin Arg Lys He Phe 
180 185 190 

Thr Phe Ala Glu Lys Tyr Leu Pro Ala Leu Gly Tyr Ser Lys Arg Val 
195 200 205 

His Leu Met Asn Pro Met Val Pro Gly Leu Thr Gly Ser Lys Met Ser 
210 215 220 

Ser Ser Glu Glu Glu Ser Lys He Asp Leu Leu Asp Arg Lys Glu Asp 
225 230 235 240 

Val Lys Lys Lys Leu Lys Lys Ala Phe Cys Glu Pro Gly Asn Val Glu 
245 250 255 

Asn Asn Gly Val Leu Ser Phe He Lys His Val Leu Phe Pro Leu Lys 
260 265 270 
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Ser Glu Phe Val lie Leu Arg Asp Glu Lys Trp Gly Gly Asn Lys Thr 
275 280 285 

Tyr Thr Ala Tyr Val Asp Leu Glu Lys Asp Phe Ala Ala Glu Val Val 
290 295 300 

His Pro Gly Asp Leu Lys Asn Ser Val Glu Val Ala Leu Asn Lys Leu 
305 310 315 320 

Leu Asp Pro lie Arg Glu Lys Phe Asn Thr Pro Ala Leu Lys Lys Leu 
325 330 335 

Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys Gin Lys Pro Met Ala Lys 
340 345 . 350 

Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu Val lie Leu Glu His His 
355 360 365 

His His His His 
370 



<210> 9 
<211> 5018 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4879) 
<220> 

<223> Description of Artificial Sequence: human 
full-length TrpRS in pET20B 

<400> 9 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 24 0 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
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agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 14 4 0 
aetgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 17 4 0 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 18 60 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 204 0 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 2460 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 



# * 
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gcatgtgtca 


gaggttttca 


ccgtcatcac 


cgaaacgcgc 


gaggcagctg 


cggtaaagct 


2700 


catcagcgtg 


gtcgtgaagc 


gattcacaga 


tgtctgcctg 


ttcatccgcg 


tccagctcgt 


2760 


tgagtttctc 


cagaagcgtt 


aatgtctggc 


ttctgataaa 


gcgggccatg 


ttaagggcgg 


2820 


ttttttcctg 


tttggtcact 


gatgcctccg 


tgtaaggggg 


atttctgttc 


atgggggtaa 


2880 


tgataccgat 


gaaacgagag 


aggatgctca 


cgatacgggt 


tactgatgat 


gaacatgccc 


2940 


ggttactgga 


acgttgtgag 


ggtaaacaac 


tggcggtatg 


gatgcggcgg 


gaccagagaa 


3000 


aaatcactca 


gggtcaatgc 


cagcgcttcg 


ttaatacaga 


tgtaggtgtt 


ccacagggta 


3060 


gccagcagca 


tcctgcgatg 


cagatccgga 


acataatggt 


gcagggcgct 


gacttccgcg 


3120 


tttccagact 


ttacgaaaca 


cggaaaccga 


agaccattca 


tgttgttgct 


caggtcgcag 


3180 


acgttttgca 


gcagcagtcg 


cttcacgttc 


gctcgcgtat 


cggtgattca 


ttctgctaac 


3240 


cagtaaggca 


accccgccag 


cctagccggg 


tcctcaacga 


caggagcacg 


atcatgcgca 


3300 


cccgtggcca 


ggacccaacg 


ctgcccgaga 


tctcgatccc 


gcgaaattaa 


tacgactcac 


3360 


tatagggaga 


ccacaacggt 


ttccctctag 


aaataatttt 


gtttaacttt 


aagaaggaga 


3420 


tatacat atg ccc aac agt gag ccc < 


gca tct ctg 


ctg gag ctg ttc aac 


3469 



Met Pro Asn Ser Glu Pro Ala Ser Leu Leu Glu Leu Phe Asn 
15 10 

age ate gcc aca caa ggg gag etc gta agg tec etc aaa gcg gga aat 3517 
Ser lie Ala Thr Gin Gly Glu Leu Val Arg Ser Leu Lys Ala Gly Asn 
15 20 25 30 

gcg tea aag gat gaa att gat tct gea gta aag atg ttg gtg tea tta 3565 
Ala Ser Lys Asp Glu lie Asp Ser Ala Val Lys Met Leu Val Ser Leu 
35 40 45 

aaa atg age tac aaa get gee gcg ggg gag gat tac aag get gae tgt 3613 
Lys Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys 
50 55 60 

cet cea ggg aac eca gea cet ace agt aat cat ggc eca gat gee aca 3661 
Pro Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr 
65 70 75 

gaa get gaa gag gat ttt gtg gae eca tgg aca gta eag aca age agt 3709 
Glu Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser 
80 85 90 

gea aaa ggc ata gae tac gat aag etc att gtt egg ttt gga agt agt 3757 
Ala Lys Gly lie Asp Tyr Asp Lys Leu lie Val Arg Phe Gly Ser Ser 
95 100 105 110 

aaa att gae aaa gag eta ata aac ega ata gag aga gee ace ggc caa 3805 
Lys lie Asp Lys Glu Leu lie Asn Arg lie Glu Arg Ala Thr Gly Gin 
115 120 125 

aga cea cac eac ttc ctg ege aga ggc ate ttc ttc tea cac aga gat 3853 
Arg Pro His His Phe Leu Arg Arg Gly He Phe Phe Ser His Arg Asp 
130 135 140 



^7 



24/52 

atg aat cag gtt ctt gat gcc tat gaa aat aag aag cca ttt tat ctg 3901 

Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu 

145 150 155 

tac acg ggc egg ggc ccc tct tct gaa gca atg cat gta ggt cac etc 394 9 
Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His Leu 
160 165 170 

att cca ttt att tte aca aag tgg etc cag gat gta ttt aac gtg ccc 3997 
lie Pro Phe lie Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro 
175 180 185 190 

ttg gte ate cag atg acg gat gac gag aag tat ctg tgg aag gae ctg 4045 
Leu Val lie Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu 
195 200 205 

acc ctg gac cag gcc tat ggc gat get gtt gag aat gcc aag gac ate 4093 
Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp lie 
210 215 220 

ate gcc tgt ggc ttt gac ate aac aag act tte ata tte tct gac ctg 4141 
lie Ala Cys Gly Phe Asp lie Asn Lys Thr Phe lie Phe Ser Asp Leu 
225 230 235 

gac tac atg ggg atg age tea ggt tte tac aaa aat gtg gtg aag att 4189 
Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys lie 
,240 245 250 

caa aag cat gtt acc tte aac caa gtg aaa ggc att tte ggc tte act 4237 
Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe Thr 
255 260 265 270 

gac age gac tgc att ggg aag ate agt ttt ect gcc ate cag get get 4285 
Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin Ala Ala 
275 280 285 

ccc tec tte age aac tea tte cca cag ate tte cga gae agg acg gat 4333 
Pro Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr Asp 
290 295 300 

ate cag tgc ctt ate cca tgt gee att gae cag gat ect tac ttt aga 4381 
He Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe Arg 
305 310 315 

atg aca agg gae gte gcc ccc agg ate ggc tat ect aaa cca gee ctg 4429 
Met Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala Leu 
320 325 330 

ttg cac tec ace tte tte cca gee ctg cag ggc gee cag acc aaa atg 4477 
Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met 
335 340 345 350 

agt gee age gae cca aac tee tec ate tte etc ace gae acg gee aag 4525 
Ser Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala Lys 
355 360 365 

cag ate aaa acc aag gte aat aag cat gcg ttt tct gga ggg aga gac 4573 
Gin He Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp 
370 375 380 
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acc ate gag gag cac agg cag ttt ggg ggc aac tgt gat gtg gac gtg 4 621 
Thr He Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val 
385 390 395 

tct ttc atg tac ctg acc ttc ttc etc gag gac gac gac aag etc gag 4669 
Ser Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu 
400 405 410 

cag ate agg aag gat tac acc age gga gee atg etc acc ggt gag etc 4717 
Gin He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu 
415 420 425 430 

aag aag gea etc ata gag gtt ctg cag cec ttg ate gea gag cac cag 47 65 
Lys Lys Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His Gin 
435 440 445 

gee egg egc aag gag gtc aeg gat gag ata gtg aaa gag ttc atg act 4813 
Ala Arg Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met Thr 
450 455 460 

cec egg aag ctg tee ttc gac ttt cag aag ett gcg gee gea etc gag 4861 
Pro Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu 
465 470 475 

cac cac cac cac cac cac tgagateegg ctgetaacaa ageccgaaag 4 909 

His His His His His His 
480 

gaagctgagt tggetgetge caccgetgag caataactag cataaccect tggggectct 4 969 
aaacgggtet tgaggggttt tttgctgaaa ggaggaacta tateeggat 5018 



<210> 10 
<211> 484 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human 
full-length TrpRS in pET20B 

<400> 10 

Met Pro Asn Ser Glu Pro Ala Ser Leu Leu Glu Leu Phe Asn Ser He 
15 10 15 

Ala Thr Gin Gly Glu Leu Val Arg Ser Leu Lys Ala Gly Asn Ala Ser 
20 25 30 

Lys Asp Glu He Asp Ser Ala Val Lys Met Leu Val Ser Leu Lys Met 
35 40 45 

Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro Pro 
50 55 60 

Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu Ala 
65 70 . 75 80 

Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys 
85 90 95 
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Gly lie Asp Tyr Asp Lys Leu lie Val Arg Phe Gly Ser Ser Lys lie 
100 105 110 

Asp Lys Glu Leu lie Asn Arg lie Glu Arg Ala Thr Gly Gin Arg Pro 
115 120 125 

His His Phe Leu Arg Arg Gly He Phe Phe Ser His Arg Asp Met Asn 
130 135 140 

Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr Thr 
145 150 155 160 

Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His Leu He Pro 
165 170 175 

Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro Leu Val 
180 185 190 

He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr Leu 
195 200 205 

Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp He He Ala 
210 215 220 

Cys Gly Phe Asp He Asn Lys Thr Phe He Phe Ser Asp Leu Asp Tyr 
225 230 235 240 

Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys He Gin Lys 
245 250 255 

His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe Thr Asp Ser 
260 265 270 

Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin Ala Ala Pro Ser 
275 280 285 

Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr Asp He Gin 
290 295 300 

Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe Arg Met Thr 
305 310 315 320 

Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala Leu Leu His 
325 330 335 

Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met Ser Ala 
340 345 350 

Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala Lys Gin He 
355 360 365 

Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp Thr He 
370 375 380 

Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val Ser Phe 
385 390 395 400 

Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin He 
405 410 415 



Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu Lys Lys 
420 425 430 
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Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His Gin Ala Arg 
435 440 445 

Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met Thr Pro Arg 
450 455 460 

Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu His His 
465 470 475 480 

His His His His 

<210> 11 
<211> 4877 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4738) 
<220> 

<223> Description of Artificial Sequence: human mini 
TrpRS in pET20B 

<400> 11 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg .60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 600 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 660 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 720 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 780 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 840 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 900 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 960 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 1020 
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aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 1080 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 1140 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 1200 
ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 14 4 0 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 204 0 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
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tgataccgat 


gaaaegagag 


aggatgctea 


egataegggt 


taetgatgat 


gaaeatgeee 


2940 


ggttactgga 


aegttgtgag 


ggtaaacaae 


tggcggtatg 


gatgeggegg 


gaeeagagaa 


3000 


aaatcactca 


gggtcaatgc 


cagegettcg 


ttaatacaga 


tgtaggtgtt 


eeaeagggta 


3060 


gccagcagca 


teetgcgatg 


eagatcegga 


acataatggt 


gcagggcgct 


gaettcegeg 


3120 


tttccagact 


ttacgaaaca 


eggaaaecga 


agaecattea 


tgttgttget 


caggtegcag 


3180 


acgttttgca 


geageagtcg 


cttcaegtte 


getegcgtat 


eggtgattea 


ttetgctaac 


3240 


cagtaaggca 


accccgeeag 


cctageeggg 


teeteaacga 


eaggagcacg 


ateatgegca 


3300 


cccgtggcca 


ggacceaaeg 


ctgcecgaga 


tetegatcce 


gegaaattaa 


taegaetcae 


3360 


tatagggaga 


ecacaaeggt 


ttecetetag 


aaataatttt 


gtttaaettt 


aagaaggaga 


3420 


tatacat atg age tac aaa get gcc gcg ggg gag gat tac aag get gae 
Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp 


3469 



15 10 

tgt cct eea ggg aae cca gea cct aee agt aat eat ggc eea gat gcc 3517 
Cys Pro Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala 
15 20 25 30 

aea gaa get gaa gag gat ttt gtg gae eea tgg aca gta eag aca age 3565 
Thr Glu Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser 
35 40 45 

agt gea aaa ggc ata gae tac gat aag etc att gtt egg ttt gga agt 3613 
Ser Ala Lys Gly He Asp Tyr Asp Lys Leu He Val Arg Phe Gly Ser 
50 55 60 

agt aaa att gae aaa gag eta ata aac ega ata gag aga gee aee ggc 3661 
Ser Lys He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly 
65 70 75 

eaa aga eea cac eae ttc etg cge aga ggc ate ttc tte tea eac aga 3709 
Gin Arg Pro His His Phe Leu Arg Arg Gly He Phe Phe Ser His Arg 
80 85 90 

gat atg aat eag gtt ctt gat gee tat gaa aat aag aag eea ttt tat 3757 
Asp Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr 
95 100 105 110 

etg tac acg ggc egg ggc ece tct tet gaa gea atg cat gta ggt cac 3805 
Leu Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His 
115 120 125 

etc att eea ttt att tte aea aag tgg etc eag gat gta ttt aae gtg 3853 
Leu He Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val 
130 135 140 

cec ttg gte ate eag atg acg gat gae gag aag tat etg tgg aag gae 3901 
Pro Leu Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp 
145 150 155 

etg ace etg gae eag gcc tat ggc gat get gtt gag aat gee aag gae 394 9 
Leu Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp 
160 165 170 



# 
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ate ate gee tgt gge ttt gae ate aae aag aet ttc ata ttc tet gae 3997 
lie lie Ala Cys Gly Phe Asp lie Asn Lys Thr Phe lie Phe Ser Asp 
175 180 185 190 

ctg gae tae atg ggg atg age tea ggt tte tae aaa aat.gtg gtg aag 4045 
Leu Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys. 

195 200 205 

att caa aag eat gtt aee ttc aae caa gtg aaa gge att tte gge ttc 4093 
He Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe 
210 215 220 

act gae age gae tge att ggg aag ate agt ttt cct gee ate eag get 4141 
Thr Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin Ala 
225 230 235 

get eee tec ttc age aac tea tte cca cag ate ttc ega gae agg acg 4189 
Ala Pro Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr 
240 245 250 

gat ate eag tge ett ate cea tgt gee att gae cag gat cct tae ttt 4237 
Asp He Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe 
255 260 265 270 

aga atg aea agg gae gtc gee cce agg ate gge tat cct aaa eea gee 4285 
Arg Met Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala 
275 280 285 

etg ttg cac tec aee ttc tte eea gee ctg cag gge gee eag ace aaa 4333 
Leu Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys 
290 295 300 

atg agt gee age gae cea aae tec tee ate ttc etc aee gae acg gee 4 381 
Met Ser Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala 
305 310 315 

aag cag ate aaa ace aag gtc aat aag cat gcg ttt tet gga ggg aga 4429 
Lys Gin He Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg 
320 325 330 

gae aee ate gag gag cac agg eag ttt ggg gge aae tgt gat gtg gae 4 4 77 
Asp Thr He Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp 
335 340 345 350 

gtg tet ttc atg tae etg aee tte ttc etc gag gae gae gae aag etc 4525 
Val Ser Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu 
355 360 365 

gag cag ate agg aag gat tae ace age gga gee atg etc aee ggt gag 4573 
Glu Gin He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu 
370 375 380 

etc aag aag gea etc ata gag gtt etg eag eee ttg ate gea gag cac 4621 
Leu Lys Lys Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His 
385 390 395 

cag gee egg cgc aag gag gtc acg gat gag ata gtg aaa gag ttc atg 4 669 
Gin Ala Arg Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met 
400 405 410 



9^ 
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act ccc egg aag ctg tec ttc gac ttt cag aag ett geg gee gca etc 4717 
Thr Pro Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu 
415 420 425 430 

gag eae eae eae eae eae eac tgagatccgg etgctaaeaa ageeegaaag 4768 
Glu His His His His His His 
435 

gaagctgagt tggctgetgc cacegetgag caataaetag eataacceet tggggeetet 4828 
aaacgggtet tgaggggttt tttgetgaaa ggaggaaeta tatecggat 4877 



<210> 12 
<211> 437 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human mini 
TrpRS in pET20B 

<400> 12 

Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro 
15 10 15 

Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu 
20 25 30 

Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala 
35 40 45 

Lys Gly lie Asp Tyr Asp Lys Leu lie Val Arg Phe Gly Ser Ser Lys 
50 55 60 

lie Asp Lys Glu Leu lie Asn Arg lie Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His His Phe Leu Arg Arg Gly lie Phe Phe Ser His Arg Asp Met 
85 90 95 

Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr 
100 105 110 

Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly His Leu He 
115 120 125 

Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro Leu 
130 135 140 

Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr 
145 150 155 160 

Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp He He 
165 170 175 

Ala Cys Gly Phe Asp He Asn Lys Thr Phe He Phe Ser Asp Leu Asp 
180 185 190 

Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val Lys He Gin 
195 200 205 
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Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly Phe Thr Asp 
210 215 220 

Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin Ala Ala Pro 
225 230 235 240 

Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg Thr Asp He 
245 250 255 

Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr Phe Arg Met 
260 265 270 

Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro Ala Leu Leu 
275 280 285 

His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met Ser 
290 295 300 

Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr Ala Lys Gin 
305 310 315 320 

He Lys Thr Lys Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp Thr 
325 330 335 

He Glu Glu His Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val Ser 
340 345 350 

Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin 
355 360 365 

He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu Lys 
370 375 380 

Lys Ala Leu He Glu Val Leu Gin Pro Leu He Ala Glu His Gin Ala 
385 390 395 400 

Arg Arg Lys Glu Val Thr Asp Glu He Val Lys Glu Phe Met Thr Pro 
405 410 415 

Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu His 
420 425 430 



His His His His His 
435 



<210> 13 
<211> 4811 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) , . (4672) 
<220> 

<223> Description of Artificial Sequence: human 
supermini TrpRS in pET20B 

<400> 13 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 



cagcgtgacc gctacacttg ccagcgccct 
ctttctcgcc acgttcgccg gctttccccg 
gttccgattt agtgctttac ggcacctcga 
acgtagtggg ccatcgccct gatagacggt 
ctttaatagt ggactcttgt tccaaactgg 
ttttgattta taagggattt tgccgatttc 
acaaaaattt aacgcgaatt ttaacaaaat 
tcggggaaat gtgcgcggaa cccctatttg 
tccgctcatg agacaataac cctgataaat 
gagtattcaa catttccgtg tcgcccttat 
ttttgctcac ccagaaacgc tggtgaaagt 
agtgggttac atcgaactgg atctcaacag 
agaacgtttt ccaatgatga gcacttttaa 
tattgacgcc gggcaagagc aactcggtcg 
tgagtactca ccagtcacag aaaagcatct 
cagtgctgcc ataaccatga gtgataacac 
aggaccgaag gagctaaccg cttttttgca 
tcgttgggaa ccggagctga atgaagccat 
tgcagcaatg gcaacaacgt tgcgcaaact 
ccggcaacaa ttaatagact ggatggaggc 
ggcccttccg gctggctggt ttattgctga 
cggtatcatt gcagcactgg ggccagatgg 
gacggggagt caggcaacta tggatgaacg 
actgattaag cattggtaac tgtcagacca 
aaaacttcat ttttaattta aaaggatcta 
caaaatccct taacgtgagt tttcgttcca 
aggatcttct tgagatcctt tttttctgcg 
accgctacca gcggtggttt gtttgccgga 
aactggcttc agcagagcgc agataccaaa 
ccaccacttc aagaactctg tagcaccgcc 
agtggctgct gccagtggcg ataagtcgtg 
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agcgcccgct cctttcgctt tcttcccttc 120 
tcaagctcta aatcgggggc tccctttagg 180 
ccccaaaaaa cttgattagg gtgatggttc 24 0 
ttttcgccct ttgacgttgg agtccacgtt 300 
aacaacactc aaccctatct cggtctattc 360 
ggcctattgg ttaaaaaatg agctgattta 420 
attaacgttt acaatttcag gtggcacttt 480 
tttatttttc taaatacatt caaatatgta 540 
gcttcaataa tattgaaaaa ggaagagtat 600 
tccctttttt gcggcatttt gccttcctgt 660 
aaaagatgct gaagatcagt tgggtgcacg 720 
cggtaagatc cttgagagtt ttcgccccga 780 
agttctgcta tgtggcgcgg tattatcccg 840 
ccgcatacac tattctcaga atgacttggt 900 
tacggatggc atgacagtaa gagaattatg 960 
tgcggccaac ttacttctga caacgatcgg 1020 
caacatgggg gatcatgtaa ctcgccttga 1080 
accaaacgac gagcgtgaca ccacgatgcc 1140 
attaactggc gaactactta ctctagcttc 1200 
ggataaagtt gcaggaccac ttctgcgctc 1260 
taaatctgga gccggtgagc gtgggtctcg 1320 
taagccctcc cgtatcgtag ttatctacac 1380 
aaatagacag atcgctgaga taggtgcctc 14 40 
agtttactca tatatacttt agattgattt 1500 
ggtgaagatc ctttttgata atctcatgac 1560 
ctgagcgtca gaccccgtag aaaagatcaa 1620 
cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
tcaagagcta ccaactcttt ttccgaaggt 1740 
tactgtcctt ctagtgtagc cgtagttagg 1800 
tacatacctc gctctgctaa tcctgttacc 1860 
tcttaccggg ttggactcaa gacgatagtt 1920 



m 
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accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 

gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 

tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 

cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 

cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 

cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 

ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 

taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 24 00 

gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 24 60 

tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 

cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 

catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 

tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 

ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 

tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 

ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 

aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 

gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 3120 

tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag 3180 

acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 3240 

cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 3300 

cccgtggcca ggacccaacg ctgcccgaga tctcgatccc gcgaaattaa tacgactcac 3360 

tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 3420 

tatacat atg agt aat cat ggc cca gat gcc aca gaa get gaa gag gat 34 69 
Met Ser Asn His Gly Pro Asp Ala Thr Glu Ala Glu Glu Asp 
15 10 

ttt gtg gac cca tgg aca gta cag aca age agt gca aaa ggc ata gac 3517 
Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly He Asp 
15 20 25 30 

tac gat aag etc att gtt egg ttt gga agt agt aaa att gac aaa gag 3565 
Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser Lys He Asp Lys Glu 
35 40 45 
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eta ata aac cga ata gag aga gcc acc ggc caa aga cca cac cac ttc 3613 

Leu lie Asn Arg lie Glu Arg Ala Thr Gly Gin Arg Pro His His Phe 
50 55 . 60 

ctg cgc aga ggc ate ttc ttc tea cac aga gat atg aat eag gtt ett 3661 
Leu Arg Arg Gly lie Phe Phe Ser His Arg Asp Met Asn Gin Val Leu 
65 70 75 

gat gee tat gaa aat aag aag eca ttt tat ctg tac aeg ggc egg ggc 3709 
Asp Ala Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr Thr Gly Arg Gly 
80 85 90 

cec tet tct gaa gea atg eat gta ggt cac etc att eca ttt att ttc 3757 
Pro Ser Ser Glu Ala Met His Val Gly His Leu lie Pro Phe lie Phe 
95 100 105 110 

aca aag tgg etc eag gat gta ttt aac gtg cce ttg gte ate eag atg 3805 
Thr Lys Trp Leu Gin Asp Val Phe Asn Val Pro Leu Val lie Gin Met 
115 120 125 

aeg gat gae gag aag tat ctg tgg aag gae ctg ace ctg gac eag gee 3853 
Thr Asp Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr Leu Asp Gin Ala 
130 135 140 

tat ggc gat get gtt gag aat gee aag gae ate ate gee tgt ggc ttt 3901 
Tyr Gly Asp Ala Val Glu Asn Ala Lys Asp lie lie Ala Cys Gly Phe 
145 150 155 

gac ate aac aag act ttc ata ttc tct gac ctg gae tac atg ggg atg 394 9 
Asp lie Asn Lys Thr Phe lie Phe Ser Asp Leu Asp Tyr Met Gly Met 
160 165 170 

age tea ggt ttc tac aaa aat gtg gtg aag att caa aag cat gtt acc 3997 
Ser Ser Gly Phe Tyr Lys Asn Val Val Lys lie Gin Lys His Val Thr 
175 180 185 190 

ttc aac caa gtg aaa ggc att ttc ggc ttc act gae age gac tgc att 4045 
Phe Asn Gin Val Lys Gly lie Phe Gly Phe Thr Asp Ser Asp Cys lie 
195 200 205 

ggg aag ate agt ttt ect gcc ate eag get get cec tec ttc age aac 4093 
Gly Lys lie Ser Phe Pro Ala lie Gin Ala Ala Pro Ser Phe Ser Asn 
210 215 220 

tea ttc cca eag ate ttc cga gae agg aeg gat ate eag tgc ett ate 4141 
Ser Phe Pro Gin lie Phe Arg Asp Arg Thr Asp lie Gin Cys Leu lie 
225 230 235 

cca tgt gee att gac cag gat ect tac ttt aga atg aca agg gac gte 4189 
Pro Cys Ala lie Asp Gin Asp Pro Tyr Phe Arg Met Thr Arg Asp Val 
240 245 250 

gcc cec agg ate ggc tat ect aaa cca gcc ctg ttg cac tec acc ttc 4237 
Ala Pro Arg lie Gly Tyr Pro Lys Pro Ala Leu Leu His Ser Thr Phe 
255 260 265 270 

ttc eca gee ctg cag ggc gee eag ace aaa atg agt gcc age gac cca 4285 
Phe Pro Ala Leu Gin Gly Ala Gin Thr Lys Met Ser Ala Ser Asp Pro 
275 280 285 
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aac tec tec ate ttc etc ace gac acg gee aag eag ate aaa aee aag 4333 
Asn Ser Ser lie Phe Leu Thr Asp Thr Ala Lys Gin lie Lys Thr Lys 
290 295 . 300 

gte aat aag eat gcg ttt tet gga ggg aga gae ace ate gag gag cac 4381 
Val Asn Lys His Ala Phe Ser Gly Gly Arg Asp Thr lie Glu Glu His 
305 310 315 

agg eag ttt ggg gge aac tgt gat gtg gac gtg tct ttc atg tac ctg 4429 
Arg Gin Phe Gly Gly Asn Cys Asp Val Asp Val Ser Phe Met Tyr Leu 
320 325 330 

ace ttc ttc etc gag gae gae gae aag etc gag eag ate agg aag gat 4477 
Thr Phe Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin lie Arg Lys Asp 
335 340 345 350 

tac ace age gga gee atg etc ace ggt gag etc aag aag gea etc ata 4525 
Tyr Thr Ser Gly Ala Met Leu Thr Gly Glu Leu Lys Lys Ala Leu lie 
355 360 365 

gag gtt ctg eag ecc ttg ate gea gag cac eag gee egg cgc aag gag 4573 
Glu Val Leu Gin Pro Leu lie Ala Glu His Gin Ala Arg Arg Lys Glu 
370 375 380 

gte acg gat gag ata gtg aaa gag ttc atg act ecc egg aag ctg tec 4621 
Val Thr Asp Glu He Val Lys Glu Phe Met Thr Pro Arg Lys Leu Ser 
yj 385 390 395 

l1 ttc gac ttt eag aag ett gcg gee gea etc gag cac cac cac cac cac 4 669 

Phe Asp Phe Gin Lys Leu Ala Ala Ala Leu Glu His His His His His 
^•^ 400 405 410 

O cac tgagatcegg ctgctaacaa agecegaaag gaagctgagt tggetgctgc 4722 

Lij His 

-f: 415 



5 .3 



cacegetgag caataactag eataacccct tggggectct aaaegggtet tgaggggttt 4782 
tttgetgaaa ggaggaacta tateeggat 4811 



<210> 14 
<211> 415 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human 
supermini TrpRS in pET20B 

<400> 14 

Met Ser Asn His Gly Pro Asp Ala Thr Glu Ala Glu Glu Asp Phe Val 
1 5 10 15 

Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly He Asp Tyr Asp 
20 25 30 

Lys Leu He Val Arg Phe Gly Ser Ser Lys He Asp Lys Glu Leu He 
35 40 45 

Asn Arg He Glu Arg Ala Thr Gly Gin Arg Pro His His Phe Leu Arg 
50 55 60 




Arg Gly lie Phe Phe Ser His Arg Asp Met Asn Gin Val Leu Asp Ala 
65 70 75 80 

Tyr Glu Asn Lys Lys Pro Phe Tyr Leu Tyr Thr Gly Arg Gly Pro Ser 
85 90 95 

Ser Glu Ala Met His Val Gly His Leu lie Pro Phe lie Phe Thr Lys 
100 105 110 

Trp Leu Gin Asp Val Phe Asn Val Pro Leu Val lie Gin Met Thr Asp 
115 120 125 

Asp Glu Lys Tyr Leu Trp Lys Asp Leu Thr Leu Asp Gin Ala Tyr Gly 
130 135 140 

Asp Ala Val Glu Asn Ala Lys Asp lie lie Ala Cys Gly Phe Asp lie 
145 150 155 160 

Asn Lys Thr Phe lie Phe Ser Asp Leu Asp Tyr Met Gly Met Ser Ser 
165 170 175 

Gly Phe Tyr Lys Asn Val Val Lys lie Gin Lys His Val Thr Phe Asn 
180 185 190 

Gin Val Lys Gly lie Phe Gly Phe Thr Asp Ser Asp Cys lie Gly Lys 
195 200 205 

lie Ser Phe Pro Ala lie Gin Ala Ala Pro Ser Phe Ser Asn Ser Phe 
210 215 220 

Pro Gin lie Phe Arg Asp Arg Thr Asp lie Gin Cys Leu lie Pro Cys 
225 230 235 240 

Ala lie Asp Gin Asp Pro Tyr Phe Arg Met Thr Arg Asp Val Ala Pro 
245 250 255 

Arg lie Gly Tyr Pro Lys Pro Ala Leu Leu His Ser Thr Phe Phe Pro 
260 265 270 

Ala Leu Gin Gly Ala Gin Thr Lys Met Ser Ala Ser Asp Pro Asn Ser 
275 280 285 

Ser lie Phe Leu Thr Asp Thr Ala Lys Gin lie Lys Thr Lys Val Asn 
290 295 300 

Lys His Ala Phe Ser Gly Gly Arg Asp Thr lie Glu Glu His Arg Gin 
305 310 315 320 

Phe Gly Gly Asn Cys Asp Val Asp Val Ser Phe Met Tyr Leu Thr Phe 
325 330 335 

Phe Leu Glu Asp Asp Asp Lys Leu Glu Gin lie Arg Lys Asp Tyr Thr 
340 345 350 

Ser Gly Ala Met Leu Thr Gly Glu Leu Lys Lys Ala Leu lie Glu Val 
355 360 365 



Leu Gin Pro Leu lie Ala Glu His Gin Ala Arg Arg Lys Glu Val Thr 
370 375 380 
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Asp Glu lie Val Lys Glu Phe Met Thr Pro Arg Lys Leu Ser Phe Asp 
385 390 395 400 

Phe Gin Lys Leu Ala Ala Ala Leu Glu His His His His His His 
405 410 415 

<210> 15 
<211> 4742 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (3428) . . (4603) 
<220> 

<223> Description of Artificial Sequence: human minor 
Trp-RS fragment in pET20B 



<400> 15 
tggcgaatgg 


gacgcgccct 


gtagcggcgc 


attaagcgcg 


gcgggtgtgg 


tggttacgcg 


60 


cagcgtgacc 


gctacacttg 


ccagcgccct 


agcgcccgct 


cctttcgctt 


tcttcccttc 


120 


ctttctcgcc 


acgttcgccg 


gctttccccg 


tcaagctcta 


aatcgggggc 


tccctttagg 


180 


gttccgattt 


agtgctttac 


ggcacctcga 


ccccaaaaaa 


cttgattagg 


gtgatggttc 


240 


acgtagtggg 


ccatcgccct 


gatagacggt 


ttttcgccct 


ttgacgttgg 


agtccacgtt 


300 


ctttaatagt 


ggactcttgt 


tccaaactgg 


aacaacactc 


aaccctatct 


cggtctattc 


360 


ttttgattta 


taagggattt 


tgccgatttc 


ggcctattgg 


ttaaaaaatg 


agctgattta 


420 


acaaaaattt 


aacgcgaatt 


ttaacaaaat 


attaacgttt 


acaatttcag 


gtggcacttt 


480 


tcggggaaat 


gtgcgcggaa 


cccctatttg 


tttatttttc 


taaatacatt 


caaatatgta 


540 


tccgctcatg 


agacaataac 


cctgataaat 


gcttcaataa 


tattgaaaaa 


ggaagagtat 


600 


gagtattcaa 


catttccgtg 


tcgcccttat 


tccctttttt 


gcggcatttt 


gccttcctgt 


660 


ttttgctcac 


ccagaaacgc 


tggtgaaagt 


aaaagatgct 


gaagatcagt 


tgggtgcacg 


720 


agtgggttac 


atcgaactgg 


atctcaacag 


cggtaagatc 


cttgagagtt 


ttcgccccga 


780 


agaacgtttt 


ccaatgatga 


gcacttttaa 


agttctgcta 


tgtggcgcgg 


tattatcccg 


840 


tattgacgcc 


gggcaagagc 


aactcggtcg 


ccgcatacac 


tattctcaga 


atgacttggt 


900 


tgagtactca 


ccagtcacag 


aaaagcatct 


tacggatggc 


atgacagtaa 


gagaattatg 


960 


cagtgctgcc 


ataaccatga 


gtgataacac 


tgcggccaac 


ttacttctga 


caacgatcgg 


1020 


aggaccgaag 


gagctaaccg 


cttttttgca 


caacatgggg 


gatcatgtaa 


ctcgccttga 


1080 


tcgttgggaa 


ccggagctga 


atgaagccat 


accaaacgac 


gagcgtgaca 


ccacgatgcc 


1140 


tgcagcaatg 


gcaacaacgt 


tgcgcaaact 


attaactggc 


gaactactta 


ctctagcttc 


1200 
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ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc 1260 
ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg 1320 
cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 1380 
gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 1440 
actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt 1500 
aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac 1560 
caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 1620 
aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc 1680 
accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 1740 
aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg 1800 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 1860 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 1920 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga 1980 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 2040 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 2100 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 2160 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 2220 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 2280 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 2340 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 2400 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg 2460 
tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat 2520 
cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct 2580 
gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct 2640 
gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct 2700 
catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt 2760 
tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg 2820 
ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa 2880 
tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 2940 
ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 3000 
aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 3060 




« 
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gccagcagca 


tcctgcgatg 


cagatccgga 


acataatggt 


gcagggcgct 


gacttccgcg 


3120 


tttccagact 


ttacgaaaca 


cggaaaccga 


agaccattca 


tgttgttgct 


caggtcgcag 


3180 


acgttttgca 


gcagcagtcg 


cttcacgttc 


gctcgcgtat 


cggtgattca 


ttctgctaac 


3240 


cagtaaggca 


accccgccag 


cctagccggg 


tcctcaacga 


caggagcacg 


atcatgcgca 


3300 


cccgtggcca 


ggacccaacg 


ctgcccgaga 


tctcgatccc 


gcgaaattaa 


tacgactcac 


3360 


tatagggaga 


ccacaacggt 


ttccctctag 


aaataatttt 


gtttaacttt 


aagaaggaga 


3420 


tatacat atg agt gca aaa ggc ata gac tac gat 
Met Ser Ala Lys Gly lie Asp Tyr Asp 


aag etc att gtt egg 
Lys Leu lie Val Arg 


3469 



15 10 

ttt gga agt agt aaa att gac aaa gag eta ata aac cga ata gag aga 3517 
Phe Gly Ser Ser Lys lie Asp . Lys Glu Leu He Asn Arg lie Glu Arg 
15 20 25 30 

gcc acc ggc caa aga cca cac cac ttc ctg cgc aga ggc ate ttc ttc 3565 
Ala Thr Gly Gin Arg Pro His His Phe Leu Arg Arg Gly He Phe Phe 
35 40 45 

tea cac aga gat atg aat eag gtt ctt gat gee tat gaa aat aag aag 3613 
Ser His Arg Asp Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys 
50 55 60 

cca ttt tat ctg tac acg ggc egg ggc cec tet tct gaa gca atg eat 3661 
Pro Phe Tyr Leu Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His 
65 70 75 

gta ggt cac etc att cca ttt att ttc aca aag tgg etc eag gat gta 3709 
Val Gly His Leu He Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val 
80 85 90 

ttt aac gtg ece ttg gtc ate cag atg acg gat gac gag aag tat ctg 3757 
Phe Asn Val Pro Leu Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu 
95 100 105 110 

tgg aag gac ctg acc ctg gac cag gee tat ggc gat get gtt gag aat 3805 
Trp Lys Asp Leu Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn 
115 120 125 

gee aag gac ate ate gee tgt ggc ttt gac ate aac aag act ttc ata 3853 
Ala Lys Asp He He Ala Cys Gly Phe Asp He Asn Lys Thr Phe He 
130 135 140 

ttc tct gac ctg gac tac atg ggg atg age tea ggt ttc tac aaa aat 3901 
Phe Ser Asp Leu Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn 
145 150 155 

gtg gtg aag att caa aag eat gtt acc ttc aac caa gtg aaa ggc att 3949 
Val Val Lys He Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He 
160 165 170 

ttc ggc ttc act gac age gac tgc att ggg aag ate agt ttt ect gee 3997 
Phe Gly Phe Thr Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala 
175 180 185 190 
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ate cag get get cec tec ttc age aae tea ttc cca cag ate tte ega 4045 
lie Gin Ala Ala Pro Ser Phe Ser Asn Ser Phe Pro Gin lie Phe Arg 
195 200 205 

gae agg aeg gat ate cag tgc ett ate cea tgt gee att gae cag gat 4093 
Asp Arg Thr Asp lie Gin Cys Leu lie Pro Cys Ala lie Asp Gin Asp 
210 215 220 

ect tac ttt aga atg aca agg gae gte gee eee agg ate ggc tat cet 4141 
Pro Tyr Phe Arg Met Thr Arg Asp Val Ala Pro Arg lie Gly Tyr Pro 
225 230 235 

aaa cca gee ctg ttg eae tee ace ttc tte cea gee etg cag ggc gee 4189 
Lys Pro Ala Leu Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala 
240 245 250 

cag ace aaa atg agt gee age gae cea aae tec tec ate tte etc ace 4237 
Gin Thr Lys Met Ser Ala Ser Asp Pro Asn Ser Ser lie Phe Leu Thr 
255 260 265 270 

gae aeg gee aag cag ate aaa ace aag gte aat aag cat gcg ttt tet 4285 
Asp Thr Ala Lys Gin lie Lys Thr Lys Val Asn Lys His Ala Phe Ser 
275 280 285 

gga ggg aga gae ace ate gag gag eae agg cag ttt ggg ggc aae tgt 4 333 
Gly Gly Arg Asp Thr He Glu Glu His Arg Gin Phe Gly Gly Asn Cys 
290 295 300 

gat gtg gae gtg tet ttc atg tac etg acc ttc ttc etc gag gae gae 4381 
Asp Val Asp Val Ser Phe Met Tyr Leu Thr Phe Phe Leu Glu Asp Asp 
305 310 315 

gae aag etc gag cag ate agg aag gat tac acc age gga gee atg etc 4429 
Asp Lys Leu Glu Gin He Arg Lys Asp Tyr Thr Ser Gly Ala Met Leu 
320 325 330 

acc ggt gag etc aag aag gea etc ata gag gtt ctg cag cec ttg ate 4477 
Thr Gly Glu Leu Lys Lys Ala Leu He Glu Val Leu Gin Pro Leu He 
335 340 345 350 

gea gag eae cag gee egg cgc aag gag gte aeg gat gag ata gtg aaa 4 525 
Ala Glu His Gin Ala Arg Arg Lys Glu Val Thr Asp Glu He Val Lys 
355 360 365 

gag ttc atg act cec egg aag etg tee ttc gae ttt cag aag ett gcg 4573 
Glu Phe Met Thr Pro Arg Lys Leu Ser Phe Asp Phe Gin Lys Leu Ala 
370 375 380 

gee gea etc gag eae eae eae eae eae eae tgagateegg ctgetaaeaa 4 623 

Ala Ala Leu Glu His His His His His His 
385 390 

agccegaaag gaagetgagt tggetgctge eaccgetgag caataaetag cataaeeect 4 683 

tggggcetet aaacgggtct tgaggggttt tttgetgaaa ggaggaaeta tatccggat 4742 
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<210> 16 
<211> 392 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human minor 
TrpRS fragment in pET20B 

<400> 16 

Met Ser Ala Lys Gly He Asp Tyr Asp Lys Leu He Val Arg Phe Gly 
15 10 15 

Ser Ser Lys He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr 
20 25 30 

Gly Gin Arg Pro His His Phe Leu Arg Arg Gly He Phe Phe Ser His 
35 40 45 

Arg Asp Met Asn Gin Val Leu Asp Ala Tyr Glu Asn Lys Lys Pro Phe 
50 55 60 

Tyr Leu Tyr Thr Gly Arg Gly Pro Ser Ser Glu Ala Met His Val Gly 
65 70 75 80 

His Leu He Pro Phe He Phe Thr Lys Trp Leu Gin Asp Val Phe Asn 
85 90 95 

Val Pro Leu Val He Gin Met Thr Asp Asp Glu Lys Tyr Leu Trp Lys 
100 105 110 

Asp Leu Thr Leu Asp Gin Ala Tyr Gly Asp Ala Val Glu Asn Ala Lys 
115 120 125 

Asp He He Ala Cys Gly Phe Asp He Asn Lys Thr Phe He Phe Ser 
130 135 140 

Asp Leu Asp Tyr Met Gly Met Ser Ser Gly Phe Tyr Lys Asn Val Val 
145 150 155 160 

Lys He Gin Lys His Val Thr Phe Asn Gin Val Lys Gly He Phe Gly 
165 170 175 

Phe Thr Asp Ser Asp Cys He Gly Lys He Ser Phe Pro Ala He Gin 
180 185 190 

Ala Ala Pro Ser Phe Ser Asn Ser Phe Pro Gin He Phe Arg Asp Arg 
195 200 205 

Thr Asp He Gin Cys Leu He Pro Cys Ala He Asp Gin Asp Pro Tyr 
210 215 220 

Phe Arg Met Thr Arg Asp Val Ala Pro Arg He Gly Tyr Pro Lys Pro 
225 230 235 240 

Ala Leu Leu His Ser Thr Phe Phe Pro Ala Leu Gin Gly Ala Gin Thr 
245 250 255 

Lys Met Ser Ala Ser Asp Pro Asn Ser Ser He Phe Leu Thr Asp Thr 
260 265 270 



Ala Lys Gin He Lys Thr Lys Val 
275 280 

Arg Asp Thr He Glu Glu His Arg 
290 295 

Asp Val Ser Phe Met Tyr Leu Thr 
305 310 

Leu Glu Gin He Arg Lys Asp Tyr 
325 

Glu Leu Lys Lys Ala Leu He Glu 
340 

His Gin Ala Arg Arg Lys Glu Val 

355 360 

Met Thr Pro Arg Lys Leu Ser Phe 
370 375 

Leu Glu His His His His His His 
385 390 
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Asn Lys His Ala Phe Ser Gly Gly 
285 

Gin Phe Gly Gly Asn Cys Asp Val 
300 

Phe Phe Leu Glu Asp Asp Asp Lys 
315 320 

Thr Ser Gly Ala Met Leu Thr Gly 
330 . 335 

Val Leu Gin Pro Leu He Ala Glu 
345 350 



Thr Asp Glu He Val Lys Glu Phe 
365 

Asp Phe Gin Lys Leu Ala Ala Ala 
380 



<210> 17 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Glu Leu Arg Val Ser Tyr 
1 5 



<210> 18 
<211> 6 
<212> PRT 

<213> Escherichia coli 
<400> 18 

Glu Thr Val Gin Glu Trp 
1 5 



<210> 19 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Ser Ala Lys Glu Leu Arg Cys Gin Cys 
1 5 
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<210> 20 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Ala Ser Val Ala Thr Glu Leu Arg Cys Gin Cys 
15 10 



<210> 21 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Ala Glu Leu Arg Cys Gin Cys 
1 5 



<210> 22 
<211> 58 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Gly Asp Glu Lys Lys Ala Lys Glu Lys lie Glu Lys Lys Gly Glu Lys 
15 10 15 

Lys Glu Lys Lys Gin Gin Ser lie Ala Gly Ser Ala Asp Ser Lys Pro 
20 25 30 

lie Asp Val Ser Arg Leu Asp Leu Arg lie Gly Cys lie lie Thr Ala 
35 40 45 

Arg Lys His Pro Asp Ala Asp Ser Leu Tyr 
50 55 



<210> 23 
<211> 58 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Pro Ala Leu Lys Lys Leu Ala Ser Ala Ala Tyr Pro Asp Pro Ser Lys 
15 10 15 

Gin Lys Pro Met Ala Lys Gly Pro Ala Lys Asn Ser Glu Pro Glu Glu 
20 25 30 

Val He Pro Ser Arg Leu Asp He Arg Val Gly Lys He He Thr Val 
35 40 45 

Glu Lys His Pro Asp Ala Asp Ser Leu Tyr 
50 55 



<210> 24 
<211> 7 
<212> PRT 

<213> Homo sapiens 
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<400> 24 

Arg Val Gly Lys He He Thr 
1 5 



<210> 25 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 25 

Arg He Gly Cys He He Thr 
1 5 



<210> 26 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Arg He Gly Arg He He Thr 
1 5 



<210> 27 
<211> 7 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 27 

Arg Val Gly Arg He He Lys 
1 5 



<210> 28 
<211> 7 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 28 

Arg Val Gly Phe He Gin Lys 
1 5 



<210> 29 
<211> 7 
<212> PRT 
<213> Bos taurus 

<400> 29 

Arg Val Gly Lys Val He Ser 
1 5 



<210> 30 
<211> 7 
<212> PRT 

<213> Mus musculus 
<400> 30 



Arg He Gly Cys He Val Thr 
1 5 



<210> 31 
<211> 7 
<212> PRT 

<213> Mesocricetus auratus 
<400> 31 

Arg He Gly Arg He Val Thr 
1 5 



<210> 32 
<211> 7 
<212> PRT 
<213> Ovis aries 

<400> 32 

Arg He Gly Cys He He Thr 
1 5 



<210> 33 

<211> 7 

<212> PRT 

<213> Calcarea sp. 

<400> 33 

Arg He Gly Arg He Thr Ser 
1 5 



<210> 34 

<211> 7 

<212> PRT 

<213> A. aeolicus 

<400> 34 

Arg Val Ala Lys Val Leu Ser 
1 5 



<210> 35 
<211> 7 
<212> PRT 

<213> Escherichia coli 
<400> 35 

Arg Val Gly Lys He Val Glu 
1 5 



<210> 36 
<211> 7 
<212> PRT 

<213> Escherichia coli 
<400> 36 

Arg Val Ala Leu He Glu Asn 
1 5 
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<210> 37 
<211> 7 
<212> PRT 

<213> Haemophilus influenzae 
<400> 37 

Arg Val Ala Lys Val Leu Lys 
1 5 



<210> 38 
<211> 7 
<212> PRT 

<213> Bacillus subtilis 
<400> 38 

Arg Val Ala Glu Val He Glu 
1 5 



<210> 39 
<211> 7 
<212> PRT 

<213> B. stearothermophilus 
<400> 39 

Arg Val Ala Glu Val Val Gin 
1 5 



<210> 40 
<211> 7 
<212> PRT 

<213> Thermus thermophilus 
<400> 40 

Arg Val Ala Glu Val Leu Ala 
1 5 



<210> 41 
<211> 6 
<212> PRT 

<213> Escherichia coli 
<400> 41 

Val Gly Glu Val Val Glu 
1 5 



<210> 42 
<211> 6 
<212> PRT 

<213> Bacillus subtilis 



<400> 42 

He Gly His Val Leu Glu 
1 5 
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<210> 43 
<211> 6 
<212> PRT 

<213> Synechococcus sp. 
<400> 43 

Val Gly Arg Val Leu Glu 
1 5 



<210> 44 
<211> 6 
<212> PRT 

<213> Thermus thermophilus 
<400> 44 

Phe Ala Arg Val Leu Glu 
1 5 



<210> 45. 
<211> 85 
<212> PRT 

<213> Homo sapiens 
<400> 45 

Met Ser Tyr Lys Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro 
15 10 15 

Pro Gly Asn Pro Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu 
20 25 30 

Ala Glu Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala 
35 40 45 

Lys Gly lie Asp Tyr Asp Lys Leu lie Val Arg Phe Gly Ser Ser Lys 
50 55 60 

He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His His Phe Leu 
85 



<210> 46 

<211> 85 

<212> PRT 

<213> Bos taurus 

<400> 46 

Thr Ser Tyr Lys Ala Ala Thr Gly Glu Asp Tyr Lys Val Asp Cys Pro 
15 10 15 

Pro Gly Asp Pro Ala Pro Glu Ser Gly Glu Gly Leu Asp Ala Thr Glu 
20 25 30 

Ala Asp Glu Asp Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala 
35 40 45 



Lys Gly He Asp Tyr Asp Lys Leu He Val Arg Phe Gly Ser Ser Lys 
50 55 60 
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He Asp Lys Glu Leu Val Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His Arg Phe Leu 
85 



<210> 47 
<211> 85 
<212> PRT 

<213> Mus musculus 
<400> 47 

Met Ser Tyr Lys Ala Ala Met Gly Glu Glu Tyr Lys Ala Gly Cys Pro 
15 10 15 

Pro Gly Asn Pro Thr Ala Gly Arg Asn Cys Asp Ser Asp Ala Thr Lys 
20 25 30 

Ala Ser Glu Asp Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala 
35 40 45 

Lys Gly He Asp Tyr Asp Lys Leu He Val Gin Pro Gly Ser Ser Lys 
50 55 60 

He Asp Lys Glu Leu He Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His Arg Phe Leu 
85 



<210> 48 
<211> 85 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 48 

Thr Ser Tyr Lys Glu Ala Met Gly Glu Asp Tyr Lys Ala Asp Cys Pro 
15 10 15 

Pro Gly Asn Ser Thr Pro Asp Ser His Gly Pro Asp Glu Ala Val Asp 
20 25 30 

Asp Lys Glu Asp Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala 
35 40 45 

Lys Gly He Asp Tyr Asp Lys Leu He Val Gin Phe Gly Ser Ser Lys 
50 55 60 

He Asp Lys Glu Leu Val Asn Arg He Glu Arg Ala Thr Gly Gin Arg 
65 70 75 80 

Pro His Arg Phe Leu 
85 
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<210> 49 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 49 

lie Ser Tyr Gin Gly Arg lie Pro Tyr Pro Arg Pro Gly Thr Cys Pro 
15 10 15 

Gly Gly Ala Phe Thr Pro Asn Met Arg Thr Thr Lys Glu Phe Pro Asp 
20 25 30 

Asp Val Val Thr Phe lie Arg Asn His Pro Leu Met Tyr Asn Ser lie 
35 40 45 

Tyr Pro lie His Lys Arg Pro Leu lie Val Arg lie Gly Thr Asp Tyr 
50 55 60 

Lys Tyr Thr Lys lie Ala Val Asp Arg Val Asn Ala Ala Asp Gly Arg 
65 70 75 80 

Tyr His Val Leu Phe Leu 
85 



<210> 50 
<211> 86 
<212> PRT 

<213> Mus musculus 
<400> 50 

lie Ser Tyr Gin Gly Arg lie Pro Tyr Pro Arg Pro Gly Thr Cys Pro 
1 5 10 15 

Gly Gly Ala Phe Thr Pro Asn Met Arg Thr Thr Lys Asp Phe Pro Asp 
. 20 25 30 

Asp Val Val Thr Phe lie Arg Asn His Pro Leu Met Tyr Asn Ser lie 
35 40 45 

Ser Pro lie His Arg Arg Pro Leu lie Val Arg lie Gly Thr Asp Tyr 
50 55 60 

Lys Tyr Thr Lys lie Ala Val Asp Arg Val Asn Ala Ala Asp Gly Arg 
65 70 75 80 

Tyr His Val Leu Phe Leu 
85 



<210> 51 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 51 

Ala Ala Ala Gly Glu Asp Tyr Lys Ala Asp Cys Pro Pro Gly Asn Pro 
15 10 15 

Ala Pro Thr Ser Asn His Gly Pro Asp Ala Thr Glu Ala Glu Glu Asp 
20 25 30 
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Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 52 
<211> 46 
<212> PRT 
<213> Bos taurus 

<400> 52 

Ala Ala Thr Gly Glu Asp Tyr Lys Val Asp Cys Pro Pro Gly Asp Pro 
15 10 15 

Ala Pro Glu Ser Gly Glu Gly Leu Asp Ala Thr Glu Ala Asp Glu Asp 
20 25 30 

Phe Val Asp Pro Trp Thr Val Gin Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 53 
<211> 46 
<212> PRT 

<213> Mus musculus 
<400> 53 

Ala Ala Met Gly Glu Glu Tyr Lys Ala Gly Cys Pro Pro Gly Asn Pro 
15 10 15' 

Thr Ala Gly Arg Asn Cys Asp Ser Asp Ala Thr Lys Ala Ser Glu Asp 
20 25 30 

Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 54 
<211> 46 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 54 

Glu Ala Met Gly Glu Asp Tyr Lys Ala Asp Cys Pro Pro Gly Asn Ser 
15 10 15 

Thr Pro Asp Ser His Gly Pro Asp Glu Ala Val Asp Asp Lys Glu Asp 
20 25 30 

Phe Val Asp Pro Trp Thr Val Arg Thr Ser Ser Ala Lys Gly 
35 40 45 



<210> 55 
<211> 41 
<212> PRT 

<213> Mus musculus 



<400> 55 

Ala Phe Ala Gly Glu Asp Phe Lys Val Asp lie Pro Glu Thr His Gly 
15 10 15 



52/52 

Gly Glu Gly Thr Glu Asp Glu lie Asp Asp Glu Tyr Glu Gly Asp Trp 
20 25 30 

Ser Asn Ser Ser Ser Ser Thr Ser Gly 
35 40 



<210> 56 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 56 

Met Gly Asp Ala Pro 
1 5 



<210> 57 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 57 

Ser Asn His Gly Pro 
1 5 



<210> 58 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 58 

Ser Ala Lys Gly lie 
1 5 



1 



1 



